Prediction of functional engrailed homology-1 protein motif from sequence

نویسنده

  • Danielle S Dalafave
چکیده

Prediction of functional peptide motifs from sequences is an important problem in bioinformatics. Experimental discovery of functional sequences is laborious. Searches for specific motifs within the increasing number of proteins available in public databases often involve extensive computer calculations. Short peptide motifs are especially hard to identify via currently available methods. Presented here is a simple and effective procedure to identify a short functional motif. The procedure is based on devising a scoring function using sequence properties. The procedure was applied on short engrailed homology-1 (eh1)-like motif. Eh1-like motif provides repressive functions by binding to the WD domain of the Gro/TLE transcriptional corepressors. Interactions of known eh1-like variants and the WD domain were modeled and studied. Sequence features crucial for the interactions, and thus the motif's functionality, were identified. A scoring function was devised based on the observed sequence features. The ability of the scoring function to discriminate between functional and nonfunctional sequences was tested on known eh1-like sequences, random sequences, and eh1-like sequences predicted by others using various bioinformatics tools. The scoring function expressed well a general relationship between sequences and their functionalities. It gave about 20% false positive findings. However, the scoring function reliably identified sequences that were not functional eh1-like motif. The procedure presented here may prove useful for predicting functional sequences of other short motifs. Given the importance of transcriptional regulation, this study on identification and evaluation of functional eh1-like sequences should facilitate further research on their transcriptional roles.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

In Silico Analysis of Primary Sequence and Tertiary Structure of Lepidium Draba Peroxidase

Peroxidase enzymes are vastly applicable in industry and diagnosiss. Recently, we introduced a new kind of peroxidase gene from Lepidium draba (LDP). According to protein multiple sequence alignment results, LDP had 93% similarity and 88.96% identity with horseradish peroxidase C1A (HRP C1A). In the current study we employed in silico tools to determine, to which group of peroxidase enzymes LDP...

متن کامل

In Silico Prediction and Docking of Tertiary Structure of Multifunctional Protein X of Hepatitis B Virus

Hepatitis B virus (HBV) infection is a universal health problem and may result into acute, fulminant, chronic hepatitis liver cirrhosis, or hepatocellular carcinoma. Sequence for protein X of HBV was retrieved from Uniprot database. ProtParam from ExPAsy server was used to investigate the physicochemical properties of the protein. Homology modeling was carried out using Phyre2 server, and refin...

متن کامل

Expression during embryogenesis of a mouse gene with sequence homology to the Drosophila engrailed gene.

Regions of the mouse and human genomes with strong homology to the Drosophila engrailed gene have been identified by Southern blot analysis. One mouse engrailed-like region, Mo-en.1, has been cloned and partially sequenced; homology with the engrailed gene is localized to a 180 bp engrailed-like homeo box and 63 nucleotides immediately 3' to it. The protein sequence this region can encode inclu...

متن کامل

In Silico Analysis of Glutaminase from Different Species of Escherichia and Bacillus

Background: Glutaminase (EC 3.5.1.2) catalyzes the hydrolytic degradation of L-glutamine to L-glutamic acid and has been introduced for cancer therapy in recent years. The present study was an in silico analysis of glutaminase to further elucidate its structure and physicochemical properties.Methods: Forty glutaminase protein sequences from different species of Escherichia and Bacillus obtained...

متن کامل

A Computational Pipeline for High- Throughput Discovery of cis-Regulatory Noncoding RNA in Prokaryotes

Noncoding RNAs (ncRNAs) are important functional RNAs that do not code for proteins. We present a highly efficient computational pipeline for discovering cis-regulatory ncRNA motifs de novo. The pipeline differs from previous methods in that it is structure-oriented, does not require a multiple-sequence alignment as input, and is capable of detecting RNA motifs with low sequence conservation. W...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 4  شماره 

صفحات  -

تاریخ انتشار 2009